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1 Introduction 

The paper considers an extremal problem. Some of the most celebrated extremal results are those of Erdos 
and Stone [ES46| and Erdos and Simonovits |ES66|. They consider the following problem: Given m € N 
and a graph F, find the maximum number of edges in a graph on rn vertices that avoids having a subgraph 
isomorphic to F. 

There are a number of ways to generalize this to hypergraphs. A fc-uniform hypergraph is one in which 
each edge has size fc. Some view fc-uniform hypergraphs as the most natural generalization of a graph 
(a graph is a 2-uniform hypergraph) and one might also generalize the forbidden subgraph problem to a 
forbidden fc-uniform subhypergraph problem. There are both asymptotic results and exact bounds (e.g. 
lldCFOOI . rfPlkOSl . llFUr9T1l ). 

Forbidden Configurations is a different (but also natural) generalization that is studied mainly by 
Richard Anstee and his colaborators. We consider the following problem: Given m € N and a hy- 
pergraph F, find the maximum number of edges in a simple hypergraph H on m vertices that avoids 
having a subhypergraph isomorphic to F. 
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We find it convenient to use the language of matrices to describe hypergraphs: Each column of a {0, 1}- 
matrix can be viewed as an incidence vector on the set of rows. 

Definition 1.1. Define a matrix to be simple if it is a {0, l}-matrix with no repeated columns. Then 
an m x n simple matrix corresponds to a simple hypergraph (or set system) on m vertices with n 
distinct edges where we allow the empty edge. Let \\A\\ denote the number of columns in A (which is the 
cardinality of the associated set system). 

Definition 1.2. Let A and B be {0, l}-matrices with the same number of rows. Define the concatenation 

[A\B] to be the configuration that results from taking all columns of A together with all columns of B. 
For t £ N, we define the product 

t-A:= \A\A\ ••■ I A]. 



Our objects of study are {0, l}-matrices with row and column order information stripped from them: 

Definition 1.3. Two {0, l}-matrices are said to be equivalent if one is a row and column permutation of 
the other. This defines an equivalence relation. An equivalence class is called a configuration. 

Abusing notation, we will commonly use matrices (representatives) and their corresponding configura- 
tions interchangeably. 

Definition 1.4. For a configuration F and a {0, l}-matrix A (or a configuration A), we say that F is a 
subconfiguration of A, and write F -< A if there is a representative of F which is a submatrix of A. We 
say A has no configuration F (or doesn't contain F as a configuration) if F is not a subconfiguration 
of A. Let Avoid(m, F) denote the set of all m-rowed simple matrices with no subconfiguration F. 

Our main extremal problem is to compute 

forb(m, F) = max{||A|| : A € Avoid(m, F)}. 



Perhaps some examples are useful: 

) = m + 1, since we can take all columns with at most one 1. 



• forb(m, 

• forb(m, 



1 


set and the complete set). 



) = 2, since we may only take the column of l's and the column of 0's (i.e. the empty 



• forb(m, 



1 1 
1 1 



) = (T) + (?) + Co)' b y takin S a11 columns with at most two l's. The proof that 



this is indeed the maximum is easy and can be found in | Ragl 1 



Surveys on the topic can be found in [Ans| and |Ra gl 1) . Let A c denote the {0, l}-complement of A 
(replace every in A by a 1 and every 1 by a 0). Note that forb(m, F) = forb(m, F c ). 

Remark 1.5. Let F and G be configurations such that F -< G. Then forb(m, F) < forb(m, G). 
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We say a column a has column sum t if it has exactly t ones. Define O m to be a column with m O's 
and 1 TO to be a column of m l's. 

For a set of rows S, we let A\s denote the submatrix of A given by restricting the rows of A to only 
those in S. 

An important general result due to Fiiredi applies to simple or to non-simple configurations. 

Theorem 1.6 (Z. Fiiredi). Let F be a given fc-rowed {0, 1}— matrix. Then forb(m, F) is 0(m k ). 

We desire more accurate asymptotic bounds. Anstee and Sali conjectured that the best asymptotic 
bounds can be achieved with certain product constructions. 

Definition 1.7. Let A and B be {0, l}-matrices. We define the product A x B by taking each column of 
A and putting it on top of every column of B. Here is an example of a product: 



A = 



1 1 
1 



B = 



1 
1 



.4 

x = 
B 









1 


1 


1 


1 ' 














1 


1 


1 





1 





1 








1 





1 





1 



Note that this is a well defined operation in configurations. 

We are interested in asymptotic bounds for forb(m,F). Let X m be the m x m identity matrix, 1^ 
be the {0, l}-complement of I m (all ones except for the diagonal) and let T m be the tower matrix: a 
matrix corresponding to a maximum chain in the partially ordered set of the power set of the vertices. For 
example, 



r 4 



Anstee and Sali conjectured that the asymptotically "best" constructions avoiding a single configuration 
would be products of X, I c and T. 

Conjecture 1.8. [ AS05 1 Let F be a configuration. Let 



P r (a, b, c) := l r x ... xI r xl r c x...xl r c xT r x ... x 



b times 



Define X(F) to be the largest number such that there exist numbers a, b, c E N with a + b + c 
such that for all r G N, 

F ^ P r {a,b, c). 



X(F) 



Then forb(m, F) is 8(r 

Observe that X(F) is always an integer. Also note that ||P r (a, b, c)|| 



r a+b 



(r + l) c which is 

Q(r x ( p >), so by taking r — \m/X(F)~\ (and perhaps deleting some rows in case X(F) \ m), we have 
that ||P r (a, b, c)|| is f2(m x ( F - ) ), so the fact that forb(m, F) is Q,{m x ^ F ') is built into the conjecture. In 
order to prove the conjecture, all that would be required would be to prove that forb(m, F) is 0{m x ^) 
for every F. A disproof could be potentially easier, as only a counterexample would be required. 
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The conjecture has been proven for all fc x t configurations F with k € {1, 2, 3} and many others cases 
in various papers. The proofs for k = 2 are in HAGS97I . for k = 3 in IIAGS97I . MAFS01I . MAS05I . For 
£ = 2, the conjecture was verified in [AK06|. For k = 4, all cases either when the conjecture predicts a 
cubic bound for F or when F is simple were completed in IAF10I . For k = 4 and F non-simple, there 
are three boundary cases with quadratic bounds, one of which is established in [ARS12|. For k e {5, 6} 
some results can be found in [ARSH |. 

Anstee has long conjectured that even finding X(F) given F was not a trivial task, and the question of 
its NP-hardness was long conjectured but not proven ([ Ans |, |Ragl 1 1). In this paper we settle that finding 
X is indeed NP-hard and we also note that one of the decision versions associated with this optimization 
problem is NP-complete, adding this function to the long list of functions known to be NP -complete, with 
the interesting plus that this function is conjectured to give the exponent of the asymptotic growth of forb. 

For relatively small configurations F we have a computer program that yields the answer (relatively) 
quickly. The source code (in C++) can be freely downloaded from: 

|http: / /matmor .unam.mx/~mraggi/| 

This program can compute X(F) for F having less than ~10 rows in just a few minutes. This task 
takes merely exponential time, not doubly exponential (as it is often the case with forbidden configuration 
problems). This program was written to perform many other tasks other than finding X(F). A description 
of the algorithm used for this task is in the appendix|4] 

2 Results 

There are two natural decision problems associated with X(F): Given F and k as inputs, 

1. Is it true that X(F) < fc? 

2. Is it true that X(F) > fc? 

We prove that the first of the two decision problems is in NP by exhibiting a certificate which can be 
checked in polynomial time. 

The main result of this paper is the following: 

Theorem 2.1. Finding X(F) is an NP-hard problem. In other words, should a polynomial-time algo- 
rithm exist for finding X(F) given F, then every problem in NP could be solved in polynomial time. 
Furthermore, the problem of "given F and fc, is X(F) < fc?" is in NP. 

Before proving this theorem, we need a few lemmas. 

Observation 2.2. Let F be a configuration with n rows. Then X(F) < n. Indeed, assume for the sake 
of contradiction that a, b, and c are such that a + b + c = n + l and F -ft P r (a, b, c). We may place each 
row of F each into a different factor of the product. The extra matrix ensures the repeated columns of F 
get repeated as many times as needed. 

This observation in particular implies that if a polynomial time algorithm existed for any of the two 
decision versions of the problem, then we'd have a polynomial time algorithm for finding X(F), which 



together with Theorem 2.1 would make the decision version of finding X(F) an NP-complete problem. 



A simple (but surprising) corollary of conjecture 1.8 if it were true, would be that repeating columns 
more than twice in F has no effect on the asymptotic behavior of forb(m, F). In other words, assuming 
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the conjecture were true, the multiplicity of a column in a configuration would not affect the asymptotic 
bound and, asymptotically, it would only matter if a column is not there (has multiplicity 0), appears once 
(has multiplicity 1), or appears "multiple times" (has multiplicity 2 or more). Formally, 

Proposition 2.3. Let F t — [G\t ■ H] with G and H simple {0, l}-matrices that have no columns in 
common. Then X{F 2 ) = X{F t ) for all t > 2. In particular, if the conjecture were true, then forb(m, F t ) 
and forb(m, F 2 ) would have the same asymptotic behavior. 

Proof: It suffices to show that given t, G, H, a, b and c, there exists an R such that for every r > R, we 
have 

F 2 = \G\2-H] ~< P r (a,b,c) 4=^ F t = [G\t ■ H] ~< P r (a, b, c). 

Since F 2 -< F t , we only need to prove that if F 2 -< P r (a,b,c) for some r, then F t -< Pu(a,b,c) 
for some R. Suppose then F 2 is contained in the product P r (a, b, c) for some r. The idea is to find a 
subconfiguration of P r (a, 6, c) in which there are some columns with multiplicity 1, and for the columns 
with multiplicity 2 or more, the multiplicity depends on r. We need r large enough so that the multiplicity 
of any one column (with multiplicity of 2 or more) is larger than t. Let x be the number of rows of F t . 
Notice the following three facts, which include definitions for Ex, E%c and Ej-. 



E x (x, r) := [(r - x) ■ X | l x ] ~< l r 

E X c(x,r) := [(r-x) ■ l x \l c x ] <l c r 
r 



E T {x,r) := 



The first and second facts are easy to see; just take any subset of x rows from T r or Z£. The third 
statement is true by taking the L^/a;J-th row of T r , the 2[r/a;J-th row of T r , etcetera, up to the x[r/a;j-th 
row. For example, if r = 5 and x = 2, we may take the second and fourth row from Tg: 



r 5 



11111 

1111 

1 1 1 

1 1 

1 



T 5 



5\{2A} 



1111 
1 1 



E T (2,5) 



Note that in the three configurations Ex(x, r),Exc(x, r) and Ej-(x, r), we have that there are some 
columns of multiplicity 1 and there are some columns for which their multiplicity can be made as large 
as we wish by making r large. Formally, let E(x, r) be one of Ex(x, r) or Ex^ (x, r) or Ej-(x, r). We 
have that for every x-rowed column a there are three possibilities: either X(a, E(x, r)) = for all r, or 

X(a, E(x, r)) = 1 for all r, or lim X(a, E(x, r)) = oo. 

r— >oo 

If a is a column for which lim A(a, E(x, r)) = oo, we may conclude that there is an R for which 

1 — >oo 

X(a, E(x, r)) > t for every r > R. 

Since F 2 is contained in P r (a, b, c) for some r, the columns in H will have multiplicity at least 2 in 
some subset of the rows of P r (a, b, c). We see that F t is also a subconfiguration of Pr(o, b, c). □ 
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3 Proof of the Main Theorem 

We now prove the main theorem. 

Proof: First we prove that the decision problem has a certificate which can be checked in polynomial 
time. A certificate that indeed X(F) < k would have to be a proof that F -< P r (a, b, c) for each triple 
a, b, c for which a + b + c = k. Note that there are at most a quadratic (with respect to the number of 
rows) number of a, b, c's which satisfy the equation, since the question has a trivial "yes" answer when k 
more than the number of rows (Observation |2.2| >. 

Given F and A configurations, one can easily construct a certificate that a configuration F is indeed a 
subconfiguration of a configuration A: explicitly state which permutation of F appears in exactly which 
rows and columns of A, For the case A — P r (a, b, c), a certificate only needs to specify which rows of F 
go inside which factors, so at most a quadratic number of these certificates-of-being-a-subconfiguration 
suffice. 

We now prove that finding X(F) is NP-hard. Suppose there existed some polynomial-time algorithm 
that finds X(F) given F, We shall prove that there would then exist a polynomial time algorithm for 
GRAPH COLORING. Suppose we are given a graph G and we wish to find the minimum number of 
colors for which there exists a good coloring of the graph. We may assume no isolated vertices. 

The idea is to construct a 3-part matrix F(G) in which the first two parts ensure there is no T orI c in a 
maximum product of the form P r (a, b, c) with no subconfiguration F(G), and the last part is constructed 
so that a partition into X's produces a partition of the vertices of the graph into independent sets and 
vice-versa. 

Suppose G has n vertices and e edges. Let M be a large number with M > n + 2 and let S be the 
incidence matrix of G (i.e., the edges of G are encoded as columns with two l's corresponding to the 
vertices that belong to the edge). Construct the following simple matrix: 





1 




F(G) := 


1 


Tm 




S 






Clearly we can construct F(G) in polynomial time (with respect to the number of vertices of G). We 
prove now that we have x(G) = X(F) — 2M + 1, which in turn would yield a polynomial time algorithm 
for GRAPH COLORING, provided we had a polynomial time algorithm for X(F). 

Now, let us study the possibilities for a product of type P r (a, b, c) that does not have F(G) as a subcon- 
figuration for any r. If b ^ 0, then we could place all of [1|2^] in the X c part of P r (a, 6, c), so o + b + c 
would be at most 1 + M + n (using Lemma 2.2 1. The same is true when c ^ 0. But if we let b = c = 0, 
P r (a, 0, 0) is just a product of X's, so let us calculate how many l's we can multiply together and still not 
create a subconfiguration F(G). 

In order for F(G) to be a part of a product of X's, every row of [1 and [1 |7m] must be in a separate 

in X (and also separate from the rows of [S"|0], since we are assuming G 



factor X, since there is no 
has no isolated vertices). 
Then two rows of the [5|0 



part can be in the same X if and only if there is no 



in those two 



rows, which, in terms of the graph, means there is no edge between those two vertices. In other words, 
partitioning [5|0] into Fs is equivalent to partitioning the vertices of G into independent sets. So if the 
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graph G cannot be colored with x(G) — 1 colors and this is the maximum, this means that X(F) = a = 
2M + x(G) - 1 > n + M + 1. Then X (G) = X(F(G)) - 2M + 1. □ 
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4 Appendix: Algorithm to find X(F) 



In this section we describe the algorithm used by the software described in the introduction. It runs 
in exponential time, of course, but it has been helpful for finding boundary configurations with given 
asymptotic bounds (see [ ARS 1 1 1, | Rag 1 1 1 and |Ans|). Perhaps this program might be used to find a 
counter example to the Anstee-Sali conjecture, provided one exists, and it isn't very large. 

4. 1 Representation of a Configuration 

We are interested in an efficient representation for configurations in order to perform the tasks described 
above. In the progress of our investigations, we have had various versions of the program. 

Most of what we want the program to do involves performing a huge number of configuration compar- 
ison operations, which is testing whether or not a configuration F is a subconfiguration of a configuration 
A. As a first approach it would seem as if, for this task, we would be required to test each row and column 
permutation of F against each submatrix of A. This is of course a very slow way to do this. A simple trick 
to speed up the computations is to keep the columns of a configuration always in some canonical order. 
Then, to test whether or not a configuration F is contained in another A, we just need to permute rows of 
F and take subsets S of rows of A and place the columns of A\s in canonical order. 

Most of the tasks we wish this program to perform involve checking whether or not a given (fixed) con- 
figuration F is a subconfiguration of a vast number of configurations A. In particular, any pre-processing 
we do on F can be considered as almost free. For example, finding all row permutations of F and storing 
them would need to be done once for each configuration F, and not at all for configurations A. 

After many attempts and experiments, it seemed that the best (fastest) way to store a configuration that 
made many of the other tasks reasonably fast is this: Maintain an array of integers where the indices of the 
array, written in binary, are the columns of the configuration, and the actual numbers of the array represent 
the number of times a column appears. That is to say, a configuration F in m rows is represented by an 
array (C++ vector) F of size 2 m . For a number a, consider the binary representation of a and consider 
it as a column with m rows. If necessary, put enough O's at the beginning of the binary representation in 
order to have the required m bits. The number ¥[a] (the a-th number of the array) represents the number 
of times that column a appears in configuration F. In the implementation, we use an array of unsigned 
characters instead of integers, since we never need a configuration with the same column repeated more 
than 255 times. An unsigned character consists of 1 byte (8 bits). 

For example, the array F = [1, 0, 0, 2, 0, 1, 0, 1] represents the following configuration (notice it has 3 
rows, since the array has size 8 = 2 3 ): 



To see this, remember we start from 0. There is a one in position = 000b, meaning the colum 
(0, 0, 0) T gets repeated one time. A two in position 3 = 01 lb, a one in position 5 = 101b and a one in 
position 7 = 111b- The columns of this matrix are the representations of these numbers in binary form. 

An observant reader might complain that this has the disadvantage that it requires storing 2™ bytes, and 
if F doesn't have many columns, most of those will be O's. But it's a minor disadvantage, because even 
at 10 rows we would only need 1024 bytes, and we usually have configurations for which the number of 
rows is 5 or less (32 bytes). Perhaps this would become more of an issue with configurations with a high 



F = 



1 1 
110 1 
1111 
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number of rows, but for those configurations, most of our tasks would require too much time to be of any 
practical use anyway. 

We came to this representation after an implementation which represented columns as an array of bits 
(C++ bitset) and storing them into an ordered tree-like structure (C++ multiset from the STL). This might 
be a more natural implementation, but profiling the code made clear that the program was spending most of 
its time counting how many columns of a certain type appeared in a configuration, and was also spending 
a considerable amount of time navigating the tree. Explicitly storing the number of times each column 
appears, and making that number instantly accessible by storing it in an array (for random access) gives a 
very noticeable speedup and allows us to consider larger problems. By representing columns as numbers, 
we can do a lot of preprocessing and compute large tables in which we have almost instant access time. 

For example, consider the following problem, which has to be done many times for our tasks: Given 
a column a and a subset S (represented by an integer as written in binary), what column is as? This is 
relatively slow to compute, but we can fill out a table by preprocessing to speed up any further access to 
it. Since we do this a few million times, the investment is sound. 

The other advantage is that it becomes immediately clear how to compare two configurations with the 
same number of rows to see if one is a column-permutation submatrix of the other; check if for any column 
(index) the integer at position c of the first array is bigger than that of the second array. To check if F is 
a subconfiguration of A, we would need to find all permutations of the rows of F (which we need to do 
just once per configuration). 

4.2 Subconfigurations 

Suppose F and A are configurations and we want to decide if F -< A. Then for every s-tuple of rows S 
(where s is the number of rows of F), we can extract from A the configuration A\s easily with our pre- 
stored table of columns and subsets. Once we've done this for each column of A and found A\g, then for 
each permutation of rows of F, we check if every column a in the array corresponding to configuration F 
appears less than or equal to the corresponding number for column a in the array of A\g, We can check 
every subset S like this. If at any point this is so, we can return true. 

There are a few speedups. Sometimes it's immediately obvious a configuration can't be contained in 
another. For example, if there are more l's in F than in A, or if F has more columns or rows than A, then 

4.3 Determining X(F) 

Given a configuration F, we wish to find X(F). In other words, we wish to find the conjectured asymp- 
totic bound for forb(m, F). 

We may make a simplification using Proposition 2.3 and assume the multiplicity of any column of F is 
at most 2. 

First, suppose we wanted to test whether or not there exists r such that configuration F is contained in 
the product 

P r (a, b, c) = I r x ... x I r x 1° x ... x I£ x T r x ... x T r . 

a times b times c times 

Building this object with r = R as calculated in Proposition |2 . 3 1 would be prohibitively slow. Instead, 
we build a set X of subconfigurations from Pr{cl, b, c) such that if F -< Pr{cl, b, c), then F -< X for 
some X 6 X. 
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Recall that if F -< Pr(ci, b, c), then the rows of F get partitioned into a + b + c parts (a part can be 
empty), where each part belongs to a factor of the product b, c). Because of Proposition 2.3 we 

can assume each column appears at most twice. Given seN, consider the following matrices: 

Aj(s)=[t-0 s I,], A Ia (s)=[t-l s I<] , A T (s)=t>[T.]. 

We see that an s-rowed configuration F (with each column repeated at most t times) is contained in /„, 
for some large m, if and only if F -< Aj(s). We can then consider all partitions of rows of F and see if 
each part is contained in the corresponding Aj, Aj^ or At. 

For example, to test whether 



F 



is contained in / x T x T, we would partition the rows of F in three parts. In this case, F has five rows, 
so consider, for example, the following partition of 5: (2, 2, 1). Consider the following representatives: 



Aj(2) 





"0" 






"1 


0" 




s ■ 












1 





A T (2) 



1 1 
1 



A T (1) 



[0 1] 



and build a 5-rowed matrix A := Aj(2) x A T (2) x A T (1). If F -< A, then F -< I x T x T. If we do 
this for every possible partition of 5, we get the desired result. 

To find X(F) we can build a tree of possibilities of products and prune whenever we hit a node that 
already has configuration F: 




TxT 



Ixlxl 



We return the product with the largest number of factors for which F is not a subconfiguration of a 
product of this form, observing that if F is a subconfiguration of a product P r (a, b, c), then it will be a 
subconfiguration in any product P r (a' , b',c') with a' > a, b' > b and d > c. 

4.4 Finding Boundary Cases and Classifying Configurations 

Given number of rows s and a number k, we wish to find all boundary cases, that is maximal and minimal 
F such that X(F) = k. We make use of the program described in the previous section to find X(F) for 
many different F. The boundary cases for small s and k can be found in BAnsI and |Ra gl 1) . 



Finding the number predicted by the Anstee-Sali Conjecture is NP-hard 
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The method we use is very straightforward: start adding columns, one by one. Build a tree of configu- 
rations, where the children of a configuration F are the ones that consist of F plus a column. Find X(F) 
for each configuration in the tree. Store all those configurations F for which X(F) = k, and then find 
only the maximal and minimal configurations in the ordering -<. If at some point we add a column and 



the bound jumps to k + 1 or higher, discard and go to the next configuration. Because of Proposition 2.3 
we only need to consider cases where columns are repeated at most twice. 

Unfortunately this method is very slow because the same configuration is searched multiple times, 
since each configuration may have many representatives. To get rid of repetition we might check for 
equivalence of configurations against everything we have stored so far. But checking if two configurations 
are equivalent is usually slow, so doing it every time is also very slow. 

What seems to work best is to do check for equivalence, but only up to a point. For example, we can 
consider all pairs of columns, test those and only take one representative of each equivalence class. Then 
from each pair, start building the tree as described above. After that, it will be relatively unlikely that 
two configurations we search are equivalent, so the amount of repetition will be relatively low. Of course, 
much repetition will still occur, but much less than in the original tree. 

Notice that it isn't as critical that classifying configurations be a fast calculation. We need to do it once 
for every s and k, but no more. Once we know the maximal and minimal quadratics for five rows, we 
never need to calculate them again. Other calculations, such as finding X(F) or What Is Missing, have to 
be performed for each configuration (or family) we wish to study, so decreasing the running time is more 
of a priority in those cases since faster programs allow us to study bigger configurations. 



